PPO robot


StepScorer: Accelerating Reinforcement Learning with Step-wise Scoring and Psychological Regret Modeling

Add code
Feb 03, 2026
Viaarxiv icon

Uncertainty-Aware Non-Prehensile Manipulation with Mobile Manipulators under Object-Induced Occlusion

Add code
Feb 02, 2026
Viaarxiv icon

Towards Bridging the Gap between Large-Scale Pretraining and Efficient Finetuning for Humanoid Control

Add code
Jan 29, 2026
Viaarxiv icon

GPO: Growing Policy Optimization for Legged Robot Locomotion and Whole-Body Control

Add code
Jan 28, 2026
Viaarxiv icon

D-Optimality-Guided Reinforcement Learning for Efficient Open-Loop Calibration of a 3-DOF Ankle Rehabilitation Robot

Add code
Jan 22, 2026
Viaarxiv icon

Adaptive Reinforcement and Model Predictive Control Switching for Safe Human-Robot Cooperative Navigation

Add code
Jan 23, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

Closing the Reality Gap: Zero-Shot Sim-to-Real Deployment for Dexterous Force-Based Grasping and Manipulation

Add code
Jan 06, 2026
Viaarxiv icon

Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning

Add code
Dec 24, 2025
Viaarxiv icon

On Swarm Leader Identification using Probing Policies

Add code
Dec 20, 2025
Figure 1 for On Swarm Leader Identification using Probing Policies
Figure 2 for On Swarm Leader Identification using Probing Policies
Figure 3 for On Swarm Leader Identification using Probing Policies
Figure 4 for On Swarm Leader Identification using Probing Policies
Viaarxiv icon